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Abstract 

During somatic differentiation, physiological DNA double-strand breaks (DSB) can drive programmed genome 
rearrangements (PGR), during whicli DSB repair pathways are mobilized to safeguard genome integrity. Because of their 
unique nuclear dimorphism, ciliates are powerful unicellular eukaryotic models to study the mechanisms involved in PGR. At 
each sexual cycle, the germline nucleus is transmitted to the progeny, but the somatic nucleus, essential for gene 
expression, is destroyed and a new somatic nucleus differentiates from a copy of the germline nucleus. In Paramecium 
tetraurelia, the development of the somatic nucleus involves massive PGR, including the precise elimination of at least 
45,000 germline sequences (Internal Eliminated Sequences, lES). IBS excision proceeds through a cut-and-close mechanism: 
a domesticated transposase, PiggylVlac, is essential for DNA cleavage, and DSB repair at excision sites involves the Ligase IV, 
a specific component of the non-homologous end-joining (NHEJ) pathway. At the genome-wide level, a huge number of 
programmed DSBs must be repaired during this process to allow the assembly of functional somatic chromosomes. To 
understand how DNA cleavage and DSB repair are coordinated during PGR, we have focused on Ku, the earliest actor of 
NHEJ-mediated repair. Two Ku70 and three Ku80 paralogs are encoded in the genome of P. tetraurelia: Ku70a and Ku80c are 
produced during sexual processes and localize specifically in the developing new somatic nucleus. Using RNA interference, 
we show that the development-specific Ku70/Ku80c heterodimer is essential for the recovery of a functional somatic 
nucleus. Strikingly, at the molecular level, PiggyMac-dependent DNA cleavage is abolished at lES boundaries in cells 
depleted for Ku80c, resulting in lES retention in the somatic genome. PiggylVlac and Ku70a/Ku80c co-purify as a complex 
when overproduced in a heterologous system. We conclude that Ku has been integrated in the Paramecium DNA cleavage 
factory, enabling tight coupling between DSB introduction and repair during PGR. 
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Introduction 

DNA double strand breaks (DSBs) are among the most 
deleterious DNA lesions: if left unrepaired, a single DSB may 
trigger cell death, while incorrect repair can give rise to 
chromosome rearrangements [1]. Cells rely on two major 
pathways to repair DSBs. Homologous recombination (HR) uses 
a homologous template to restore the sequence of the broken 
chromosome, while non-homologous end joining (NHEJ) proceeds 
through the ligation of free DNA ends. Even though they can be 
very toxic, programmed DSBs are obligatory intermediates in 
essential biological processes, such as meiosis or acquired immune 



response. During meiosis, the Spoil endonuclease cleaves 
DNA and DSB repair is carried out by HR [2]. In addition to 
favoring the exchange of parental alleles, HR ensures that 
homologous chromosomes are correctly paired before they are 
segregated during the first meiotic division. During lymphocyte 
differentiation, programmed genome rearrangements (PGR) 
mediated through V(D)J recombination generate the 
large diversity of immunoglobulin genes [3]. During V(D)J 
recombination, the domesticated transposase RAGl, associat- 
ed with its partner RAG2, cleaves specific recombination sites. 
The resulting DSBs are repaired through classical NHEJ (C- 
NHEJ). 
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Author Summary 

DNA double-strand breaks (DSBs) are potential threats for 
chromosome stability, but they are usually repaired by two 
major pathways, homologous recombination or non- 
homologous end joining (NHEJ). DSBs can also be essential 
during physiological processes, such as the programmed 
removal of germline sequences that takes place in various 
eukaryotes, including ciliates, during somatic differentia- 
tion. We use the ciliate Paramecium tetraurelia as a 
unicellular model to study how DNA breakage and DSB 
repair are coordinated during programmed genome 
rearrangements. In this organism, assembly of the somatic 
genome involves the elimination of ~25% of germline 
DNA, including the precise excision of thousands of short 
Internal Eliminated Sequences (IBS) scattered along germ- 
line chromosomes. A domesticated piggyBac transposase, 
PiggyMac, is required for double-strand DNA cleavage at 
lES ends and lES excision sites are very precisely repaired 
by the NHEJ pathway. Here, we report that a specialized Ku 
heterodimer, specifically expressed during programmed 
genome rearrangements, is an essential partner of 
PiggyMac and activates DNA cleavage. We propose that 
incorporation of DSB repair proteins in a pre-cleavage 
complex constitutes a safe and efficient way for Parame- 
cium to direct thousands of programmed DSBs to the 
NHEJ pathway and make sure that somatic chromosomes 
are assembled correctly. 

A critical step in C-NHEJ is the binding of the Ku70/Ku80 
heterodimer to broken DNA ends [4] . Upon binding, Ku protects 
DNA ends from extensive resection [5] and, together with its 
facultative partner DNA-PKcs, facilitates the synapsis of two 
broken ends. Following recruitment of DNA processing enzymes, 
the Ligase IV-Xrcc4 complex mediates the joining of DNA ends. 
An alternative end joining pathway, referred to as alt-NHEJ (or 
MMEJ, for microhomology-dependent end joining), has been 
reported [6] . This poorly characterized pathway is independent of 
Ku and, to some extent, of Ligase IV. Because of the absence of 
Ku, alt-NHEJ involves limited 5' to 3' resection of broken DNA 
ends and generates deletions at DSB repair sites, which often 
involve microhomologies. When DSBs are repaired through HR, 
5' to 3' resection also takes place, but two steps can be 
distinguished: initial short-range end resection relies on the same 
factors as alt-NHEJ [7], while subsequent long-range resection 
generates the long 3' single strand that will invade a homologous 
DNA duplex [8]. 

Ciliates provide extraordinary models to study the interplay 
between DNA cleavage and DSB repair during PGR [9]. In these 
unicellular eukaryotes, two kinds of nuclei coexist in the same 
cytoplasm. The highly polyploid somatic macronucleus (MAC) is 
essential for gene expression but it is destroyed at each sexual 
cycle, while the diploid micronucleus (MIC) undergoes meiosis 
and transmits the germline genome to the new MIC and MAC of 
the next generation. In Paramecium, massive PGR take place in 
the new developing MAC, while the genome is amplified from 2n 
to 800n [10]. These PGR consist in the elimination of two types of 
germline-specific DNA. Regions of up to several kbp in length, 
often containing repeated sequences, are eliminated in a 
heterogeneous manner, leading to chromosome fragmentation or 
intra-chromosomal deletions. In addition, thousands of single- 
copy, short and non-coding Internal Eliminated Sequences (lES) 
are excised precisely. Because 47% of genes are interrupted by at 
least one lES in the germline genome [1 1], the precise excision of 
lESs is essential for the assembly of functional genes in the new 



MAC and the survival of the sexual progeny. Paramecium lESs 
are invariably flanked by one TA dinucleotide on each side and 
litde additional information can be found in their nucleotide 
sequence, which raises the question of how these sequences are 
recognized and targeted for excision. In fact, the excision of an 
estimated one-third of Paramecium lESs is controlled maternally 
through a sequence homology-dependent mechanism [12]. For 
these so-called maternally-controlled lESs, a genome-wide com- 
parison of the germline and rearranged versions of the genome 
involves non-coding RNAs [13,14]. This epigenetic control drives 
the trans-generational inheritance of rearrangement patterns, from 
the old to the new MAC. 

lES excision proceeds through a two-step "cut-and-close" 
mechanism. A domesticated piggyBac transposase, PiggyMac 
(Pgm), is essential to introduce the DNA cleavages that initiate 
the reaction, generating 4-bp staggered DSBs centered on the 
conserved TA at each lES boundary [15]. Following lES release, 
precise DSB repair is carried out through the C-NHEJ pathway 
and leaves a single TA at the lES excision site [16]. Providing their 
length allows enough DNA flexibility, the excised linear lESs are 
circularized, also through the C-NHEJ pathway, before they are 
actively degraded. During DSB repair, the flanking broken ends 
are thought to anneal through the pairing of the complementary 
TAs carried by their 4-base 5' overhangs, and undergo limited 5' 
and 3' processing [17]. The final ligation step is mediated by the 
NHEJ-specific ligase complex, Ligase IV and its partner Xrcc4, 
both of which are essential for PGR. When LIG4 or XRCC4 genes 
are knocked down, Pgm-dependent DSBs are introduced normal- 
ly, but they accumulate in the developing new MAC, which 
correlates with severely compromised DNA amplification and 
impairs the recovery of viable progeny [16]. 

Next-generation sequencing of the non-rearranged genome of 
Paramecium tetraurelia led to the identification of at least 45,000 
lESs [1 1]. Therefore, a huge number of programmed DSBs have 
to be repaired precisely during MAC development. In the present 
study, we have addressed the question of how the C-NHEJ 
pathway is recruited to lES excision sites to carry out efficient and 
precise DSB repair. We have focused our analysis on the Ku 
heterodimer, which is the most upstream actor of the C-NHEJ 
pathway. Two KU70 and three KU80 genes were identified in the 
somatic genome [16]. We report here that development-specific 
KU genes are essential for PGR. Surprisingly, we demonstrate that 
Ku is required for the introduction of programmed DSBs at lES 
boundaries, and provide evidence that Ku interacts functionally 
and physically with Pgm. We propose that the Ku70a/Ku80c 
heterodimer forms a complex with Pgm and activates DNA 
cleavage during PGR in P. tetraurelia. 

Results 

Development-specific KU genes in P. tetraurelia 

Two genes encoding Ku70 homologs, KU70a and KU70b, 
were identified in the macronuclear genome assembly of P. 
tetraurelia (Figxire lA and [16]). These two closely related copies 
arose from a recent whole genome duplication (WGD) that took 
place during the evolution of this species [18], and are referred to 
as ohnologs. We also identified three homologs of the human 
KU80 gene (Figure lA). KU80a and KUSOb are ohnologs from 
the recent WGD, while the more distant KU80c diverged after an 
earlier, "intermediate", WGD. 

Sexual processes in Paramecium may occur in two different 
ways: during conjugation, following the mixing of two reactive 
partners with compatible mating types, or during a self-fertilization 
process called autogamy, in which MIC meiosis is induced upon 
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Figure 1. KU70and KUSOgenes in P. tetraurelia. (A) Diagram of the 
WGD relationships between KU70 or KU80 genes in P. tetraurelia. The % 
of identity between genes (nt) and proteins (aa) are indicated along the 
arrows connecting two genes. (B) Transcription profiles of KU70 and 
KU80 genes during an autogamy time-course of strain 51, as 
determined by high-throughput RNA-seq. V: vegetative cells; S: starved 
or meiotic cells with intact parental IVIAC; TO: 50% of cells with 
fragmented MAC; the following time-points refer to hours after TO [51]. 
On the vertical axis, FPKM represents the number of fragments per 
gene kb per million of fragments that were uniquely mapped on the 
genome. (C) Detection of KU70 and KU80c mRNA during autogamy, 
through northern blot hybridization. Control RNAi: RNAi against the 
nonessential ND7 gene [43], which encodes an exocytosis protein. V: 
vegetative cells. The times refer to hours after TO, the time at which 50% 
of cells have a fragmented IVIAC (Figure SI A). 
doi:10.1371/journal.pgen.1004552.g001 
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Figure 2. Nuclear localization of GFP-Ku fusions during 
autogamy. Cells were microinjected with fusion transgenes expressing 
GFP-Ku70a (panels a, b, c) and GFP-Ku80c (panels d, e, f) under the 
control of their respective transcription signals. In autogamous cells, 
developing MACs are indicated by yellow arrowheads, the other DAPI- 
stained nuclei are fragments from the old vegetative MAC. For each 
protein, the GFP fluorescence sometimes concentrated in nuclear foci 
of unclear biological significance (see cells in panels c and f, and 
enlarged inserts on the right). In this particular experiment, expression 
of the GFP fusions had no significant effect on the recovery of viable 
post-autogamous progeny (87% progeny with functional new MACs for 
GFP-Ku70a, 90% for GFP-KuBOc). 
doi:10.1371/journal.pgen.1004552.g002 



the levels oi KU70 and KUSOc mRNAs do not decrease at later 
time-points after Pgm depletion, suggesting diat the completion of 
PGR is a signal for transcriptional switch-off. 



starvation in cells of a single mating type [19]. A microarray 
analysis of the P. tetraurelia transcriptome during autogamy 
revealed that KU70a and KUSOc are specifically induced during 
the development of the new MAC, when PGR take place [16,20]. 
The microarray data were confirmed by high-throughput 
sequencing of polyadenylated RNAs extracted during an autog- 
amy time-course (Figure IB, Arnaiz et al. in prep.), and by 
northern blot hybridization with specific probes (Figures IC and 
3C, left panels; Malinsky et al. in prep.). After Pgm depletion, the 
transcription o{ KU70 and KUSOc is switched on at the expected 
time-point during autogamy (Figure IC, right panels), indicating 
that KU genes are not induced as a response to Pgm-iiiduced 
DSBs, but more likely as part of a general transcription program 
during MAC development. Moreover, in contrast to control cells, 



Ku70a and Ku80c localize in the developing new MAC 

The expression of KU70a and KUSOc is specifically induced 
during autogamy and reaches a peak when the developing new 
IVlACs start to be detected in the culture (T5 and Tl 1 time-points 
in Figure IB). Using N-terminal GFP fusions, we followed the 
cellular localization of Ku70a and Ku80c during autogamy. 
Transgenes expressing each fusion protein under the control of 
their respective endogenous promoters were microinjected into the 
MAC of vegetative cells. The resulting transformants were grown 
and starved to induce autogamy. Consistent with the transcrip- 
tome analysis, GFP fluorescence was stronger in autogamous cells 
with respect to vegetative cells for each fusion transgene (Figure 2). 
Both fusion proteins accumulated in the developing new MACs, 
suggesting a possible involvement of Ku70a and Ku80c in PGR 
during MAC development. 
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Ku70 and Ku80c are required for successful completion 
of autogamy 

To test the implication of the difTerent KU genes in MAC 
development, we knocked them down systematically by feeding 
wild-type P. tetraurelia cells on dsRNA-producing bacteria to 
induce RNA interference [21]. The very high percentage of 
identity between KU70a and KU70h made it impossible to design 
specific RNAi constructs for each individual gene. Therefore, we 
silenced them both together. In contrast, we designed gene-specific 
RNAi constructs for KU80a, KU80h and KU80c. Whenever 
possible, to make sure that the observed phenotypes would be 
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Figure 3. RNAi screen for essential KU genes during autogamy. 

(A) Survival of the post-autogamous progeny of cells submitted to 
different combinations of RNAi. Kp: autogamy in standard K. 
pneumoniae medium; iCL7: RNAi against ICL7, a nonessential gene that 
encodes an infraciliary lattice centrin [47]; XR: RNAi against XRCC4. RNAi 
experiments against KU80 genes were performed using gene-specific 
inserts KU80-a2, KU80-b2 and KU80-c2 (see Figure S2). For each 
condition, 30 to —140 post-autogamous cells were analyzed. Each bar 
represents the percentage of viable post-autogamous cells carrying a 
functional new MAC, for each condition. Error bars represent the Wilson 
score intervals (95% confidence level), which are appropriate for a small 
number of trials or for values close to an extreme probability. (B) DAPI- 
staining of developing MACs during RNAi against ND7 (control), KU70, 
KU80C, XRCC4/UG4 and LiG4 + KU80c. Developing MACs are indicated by 
yellow arrowheads. Upon UG4 or XRCC4 RNAi, developing MACs exhibit 
faint DAPI staining, which correlates with a defect in DNA amplification 
[16]. (C) Northern blot hybridization of total RNA during a control time 
course experiment (WD7 RNAi) and in a KU80c RNAi. V: vegetative cells; 
TO: 60% of cells with fragmented MAC; other time-points refer to hours 
following TO (Figure SIB). 
doi:1 0.1 371/journal.pgen.1 004552.g003 



attributable to the silencing of each targeted gene, we used RNAi 
constructs homologous to two different regions in each KU80 gene 
(Figure S2). After three days of starvation in each silencing 
medium, individual autogamous cells were transferred to standard 
growth medium and allowed to resume vegetative growth. The 
survivors were grouped in three categories: (i) those that were able 
to undergo a new round of autogamy following a few divisions, 
most likely because they had regenerated their old MAC, were not 
considered as bona fide post-autogamous progeny [16]; (ii) slow- 
growing survivors were counted as progeny with a defective new 
MAC; (iii) those survivors that remained in the vegetative state 
when starved after a few divisions were classified as fully viable 
post-autogamous progeny. Only the recovery of survivors from the 
third category indicated that the silenced cells had been able to 
form a functional new MAC (Figure 3A). 

When KU80a or KU80h were knocked down, either individ- 
ually or together, the progeny exhibited good survival rates when 
compared to a control RNAi against the nonessential ICL7 gene, 
or with respect to cells that underwent autogamy in standard 
medium (Figure 3A). In contrast, RNAi against KU80c or KU70 
yielded only 10 to 30% viable sexual progeny, showing that the 
developmentally-induced KU80c gene and one or both KU70 
genes are required for the completion of autogamy. Interestingly, 
we observed that new MACs develop normally at the cytological 
level in cells silenced for KU70 or KU80c (Figure 3B), while cells 
silenced for LIG4 or XRCC4 were previously shown to harbor 
small new developing MACs that remain faintly stained with 
DAPI, correlating with a block in DNA amplification [16]. Bright 
DAPI staining of the new IVIACs indicates that DNA amplification 
takes place normally in Ku-depleted cells. 

Residual survival was observed reproducibly following Ku70 or 
Ku80c depletions, in contrast to the severe lethality phenotype of 
Xrcc4 depletion (Figure 3A). This phenotypic difference, which 
was also noted during mouse embryonic development [22], may 
be explained by the fact that Ku acts upstream of Ligase IV/ 
Xrcc4 during C-NHEJ. Upon Ku depletion, alternative DSB 
repair pathways could restore chromosome integrity to some 
extent, while Ligase IV/Xrcc4 depletion would constitute a dead- 
end for DSBs, committed to C-NHEJ in the presence of Ku. We 
noted, indeed, that normal DAPI staining of the developing MAC 
is restored in Ligase IV-depleted cells by simultaneously depleting 
Ku80c (Figure 3B). At this stage of our study, this observation is 
consistent with the hypothesis that activation of alternative Ku- 
independent DSB repair pathways may rescue DNA amplification 
in the new IVlAC, therefore allowing the recovery of low amounts 
of post-autogamous survivors. An alternative explanation for the 
residual survival observed upon Ku80c depletion could also be 
that KU80c transcripts are only partially degraded by RNAi, as 
testified by the restoration of significant amounts of full-length 
niRNA at T30 and T40 (Figure 3C, top lane). 

KU70 and KUBOc are essential for the programmed 
elimination of germline DNA 

We showed previously that the NHEJ-specific Ligase IV/Xrcc4 
ligation complex joins broken DNA ends at lES excision sites [16]. 
Strikingly, after Ligase IV depletion, unrepaired broken ends 
remain very stable throughout autogamy, suggesting that Ku 
protects them against degradation. To gain further insight into the 
role of Ku during lES excision, we performed a molecular analysis 
of PGR in Ku-depleted cells. As predicted from the known role of 
Ku during C-NHEJ, we expected that DSBs would be introduced 
normally at lES boundaries, but that the broken DNA ends would 
not be directed to the C-NHEJ pathway. 
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PCR primers (see Table SI). 
doi:1 0.1 371/Journal.pgen.1 004552.g004 



We first focused our analysis on cells depleted in KuSOc. Using 
Southern blot hybridization, we monitored the excision of one 
particular lES, lES 51G4404 from the surface antigen G*^ gene 
[23], during an autogamy time-course (Figure 4A). In a control 
RNAi (left panel), a major band corresponded to the lES 
molecules from the old MAC and the newly rearranged molecules 
from the developing new MAC. A higher molecular weight species 
corresponding to the non-excised lES"'" form was transientiy and 
only barely detectable during MAC development (lanes 2' to 4), 
and was completely absent at late time-points as a result of lES 
excision. In contrast, the lES^ form was clearly amplified during 
autogamy after KuSOc depletion (right panel). Using a related 
strategy, we tested chromosome fragmentation at one particular 
fragmentation site located downstream of the G'^ gene (Fig- 
ure 4B). We observed that non-rearranged molecules are ampli- 
fied during MAC development in Ku80c-depleted cells, which 
indicates that Ku is also required for the elimination of the 
germline DNA that is associated with chromosome fragmentation. 

The amplification of non-rearranged DNA indicates that PGR 
do not proceed normally after KuSOc depletion. However, because 
of the presence of old MAC DNA in our samples, the above 
experiment could not detect whether residual rearrangements had 
taken place in the new MAC. We could circumvent this problem, 
because the strain used in this experiment (5 1 AA) harbors a wild- 
type germline genome, but carries a somatic deletion of the 
nonessential surface antigen gene During autogamy, all lESs 
are excised normally from the A^^ gene before the whole locus is 
deleted from the new somatic MAC [24]. Therefore, in the 51AA 



variant, all the IES~ molecules originating from this locus may be 
attributed to de novo lES excision from the new MAC. Using PCR 
primers hybridizing in the flanking sequences, excised (lES ) and 
non-excised (lES"^) molecules were readily detected in a control 
RNAi experiment, for lESs belonging to different classes 
(Figwe 4C, left panel): short (51A1835: 28 bp), intermediate 
(51A4404: 77 bp) or long (51A2591: 370 bp), maternally 
(51A2591) or non-maternally controlled (51A1835 and 
51A4404). This stands in sharp contrast to the complete absence 
of de novo lES excision junctions after Ku depletion (right panel). 
Likewise, using lES-specific internal divergent primers, we 
followed the appearance of excised lES circular molecules during 
the autogamy of control and KUSOc-sienced cells, but no circle 
junctions were detected following Ku80c depletion (Figure 4D). 

To extend our analysis to the other KU genes, we submitted 
small-scale cultures of 51AA cells to RNAi against KU70 or 
individual KU80 genes, and tested their ability to complete lES 
excision during autogamy. When KU80a and KU80b were 
knocked down, together or separately, rearranged IES~ molecules 
appeared at day 2 of starvation, with the same timing as in a 
control RNAi (Figure S3). In contrast, the appearance of de novo 
lES excision junctions was strongly impaired upon Ku70 
depletion, similar to Ku80c depletion. Taken together, our 
molecular data indicate that Ku70 and the development-specific 
Ku80c are essential for the recovery of both precise chromosomal 
junctions and excised lES circles during PGR. Moreover, after 
Ku70 or Ku80c depletion, the non-rearranged version of the 
genome is amplified in the developing MAC, a strikingly difierent 
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phenotype from that of Ligase IV or Xrcc4 depletions [16], but 
quite similar to Pgm depletion [15]. 

No DNA breaks can be detected by LMPCR in cells 
depleted for Ku80c 

The absence of precise de novo lES excision junctions after 
Ku70 or Ku80c depletions could either reflect a problem in DSB 
repair, as established for Ligase IV depletion, or an inhibition of 
DNA cleavage, as demonstrated for Pgm depletion. Because Ku is 
essential for C-NHEJ in all organisms, a defect in end-joining was 
expected after Ku depletion in P. tetraurelia. However, the 
observation that non-excised lESs were amplified in the new MAC 
of cells depleted for Ku70/Ku80c raised the issue of whether 
DSBs were actually introduced at lES bovmdaries. We therefore 
used a sensitive ligation-mediated PGR assay (LMPCR) to search 
for DSBs at lES boundaries after Ku depletion. As previously 
published [17], for those lESs that were tested, free broken DNA 
ends at lES boundaries were detected at early autogamy time- 
points in cells subjected to a control RNAi (Figure 5A). DSBs 
disappeared later on during MAC development, indicative of 
efficient repair. After Ku80c depletion, no specific broken ends 
were detected on the MAC side or the lES side of the DSB 
(Figure 5A). 

An important control was to verify whether Pgm is produced 
and imported normally into the developing MAC upon Ku 
depletion. Cells expressing a Pgm-GFP fusion under the control of 
the endogenous PGM transcription signals were subjected to a 
control RNAi or to RNAi against KU80c (Figure 6). In the 
control, the fusion protein appeared in the developing new MAC 
at day 2 of starvation, and formed small foci until day 3 
(Figure 6A). Pgm-GFP foci disappeared at day 4, which 
corresponds to the time when most PGR are completed in control 



ceUs (see Figure S3). In a KU8(k RNAi, the Pgm-GFP fusion also 
localized specifically to the developing new MAC (Figwe 6B), 
indicating that Ku is not required for the nuclear import of Pgm. 
However, Ku80c depletion triggered a dramatic increase in the 
nuclear amount of the Pgm-GFP fusion (Figure 6B), which 
parallels the accumulation of the endogenous PGM mRNA 
observed upon KU80c RNAi in cells harboring no fusion 
transgene (Figure 3C, right panels). We also observed a striking 
difference in the subnuclear localization of Pgm, in a KU80c 
RNAi relative to the control. Indeed, at day 3, Pgm-GFP 
accumulated in large nuclear bodies, which were clearly detectable 
under differential interference contrast (DIG) and coincided with 
DAPI-free regions (Figure 6B, right panels). At day 4, GFP 
fluorescence was still high, but large nuclear bodies were no longer 
detectable. The apparently normal subnuclear organization of 
Pgm-GFP observed at day 4 correlated with the recovery of 
functional amounts of KUSOc mRNA at very late time-points 
(Figure 3C, top right panel). 

No detectable DSB repair intermediates in Ku-depleted 
cells 

In other organisms, Ku is known to protect broken DNA ends 
against degradation [1], and Ku depletion may reveal alternative 
DSB repair pathways, such as alt-NHEJ or HR, by allowing 5' to 
3' DNA end resection. Should resection occur upon Ku depletion 
in P. tetraurelia, the resected 5' DNA ends would not be 
appropriate substrates for the LMPCR assay displayed in 
Figure 5, because the linkers that were used only allow detection 
of DSBs with a specific geometry (4- or 3-base 5' overhangs). We 
first investigated whether alt-NHEJ might rescue Ku depletions, 
by repeating the PGR assays shown in Figure 4C with more 
distant primers hybridizing 1 kb away from the lES excision site. 



LMPCR: 



B 



Tdl tailing: 



^ccccccccccccc 



Control RNAi KUSOc RNAi 

123456 GATC 123456 



IES51A1835 
MAC end 

lES 51A2591 
MAC end 

lES 51A4404 
MAC end 

IBS 51G4404 
MAC end 

IES51G4404 
lES end 



Control RNAi KUSOc RNAi 

1 23456 GATC 1 23456 



lES 51A1835 
MAC end 

lES 51A2591 
MAC end 

lES 51A4404 
MAC end 




Figure 5. Detection of programmed DSBs at lES boundaries during autogamy. (A) LIVIPCR detection of DSBs at IVIAC or lES ends during 
autogamy of 51 AA cells subjected to RNAi against ICL7 (control) or KUSOc (same samples as in Figure 4). A Sanger DNA sequencing ladder provides 
size markers. On the diagram, the LMPCR linker is drawn as grey boxes and Paramecium DNA as black lines, with a black dot representing the 3' end 
generated by Pgm-dependent cleavage. In the KUSOc RNAi, the LIVIPCR signals at 51G4404 MAC ends are likely due to background DNA breaks 
generated at the MAC G^' locus during DNA extraction. This background is not detected for lESs of the /\^' gene, because this locus is absent from 
the old MAC. (B) TdT tailing of free 3'OH ends during autogamy of 51 AA cells subjected to RNAi against ICL7 (control) or KUSOc (same samples as in 
(A)). On the diagram, the potentially resected 5' end is represented by a dotted line. 
doi:l 0.1 371/journal.pgen.l 004552.g005 
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Figure 6. Nuclear accumulation of a Pgm-GFP fusion in KuSOc- 
depieted cells. Cells were microinjected with a PGM-GFP fusion 
transgene and one transformant was submitted to RNAi against ICL7 
(control: panel A) or KU80c (panel B). The progression of autogamy was 
monitored over a four-day starvation period (a and e: day 1; b and f: day 
2; c and g: day 3; d and h: day 4). To compare the intensities of GFP 
fluorescence, signals were acquired with the same exposure time, and 
identical window settings were applied to the image display using the 
ImageJ software (National Institute of Health). Developing IVlACs are 
indicated by yellow arrowheads. In the enlarged inserts shown on the 
right of each panel, the display settings were modified to highlight the 
Pgm-GFP nuclear foci. The white arrowheads in panel B point to the 
DAPI-free regions, in which overproduced Pgm-GFP accumulates 
following KU80c RNAi. In the control RNAi, 93% of post-autogamous 
progeny had a functional new MAC, while the KU80c RNAi yielded no 
progeny with a functional new MAC. 
doi:1 0.1 371 /journal.pgen.1 004552.g006 

This would have allowed us to detect alternative repair junctions 
with small deletions attributable to alt-NHEJ. Even under these 
conditions, however, no heterogeneous excision junctions could be 
detected (Figure S4). Second, because lES excision is concomitant 
with genome endoduplication, we reasoned that HR could use yet 
non-rearranged DNA molecules as templates to repair DSBs at 
lES excision sites: this would account for the amplification of lES'*" 
molecules that we observed in the new MAC (Figure 4A). During 
HR-mediated DSB repair, the free 3'OH ends resulting from 
Pgm-dependent DNA cleavage would not be degraded during 5' 
to 3' resection, and should be detectable through polynucleotidyl 
terminal transferase (TdT) tailing [17]. In a control RNAi, indeed, 
free 3'OH ends were detected at lES boundaries (Figure 5B), with 
the same timing as the DSBs observed using LMPCR (Figure 5A). 
However, no free 3'OH ends were found at the expected position 
in a KU80c RNAi (Figure 5B). Taken together, our data do not 
support the hypothesis tiiat, in cells depleted for Ku, broken lES 
excision sites are repaired through an alternative pathway 
involving 5' to 3' resection. Our results rather suggest that Ku is 
required for Pgm-dependent DNA cleavage itself 

Ku and Pgm form a complex in soluble cell extracts 

A functional interaction between Ku and Pgm to activate DNA 
cleavage at lES boundaries may rely on the formation of a protein 
complex containing both proteins. To investigate this hypothesis. 



MBP-Pgm 
Ku70a-HA 
HA-Ku80c 




MBP co-purif 



- HA-Ku80c 

- Ku70a-HA 

-- HA-Ku80c 

- Ku70a-HA 



B 



Ku70a-HA 
HA-Ku80c 
Pgm 



+ 
+ 



Input 

- + 

+ + 

+ + 

— Pgm 



a HA IP 



I- Pgm 



Input 



DNasel 
MBP-Pgm - 
HA-Ku80c + 
Ku70a 



+ + 
+ + 
- + 



- + + 
+ + + 

- - + 

MBP co-purif 



,- HA-Ku80c 



Figure 7. Physical interaction between Pgm and Ku. (A) Co- 
precipitation of HA-Ku80c and Ku70a-HA with MBP-Pgm from insect cell 
extracts (top panel), revealed on western blots using an anti-HA 
antibody. (B) Co-immunoprecipitation of Pgm with HA-Ku80c and/or 
Ku70a-HA from insect cell extracts, revealed on western blots using an 
anti-Pgm antibody. In A and B, the input proteins from each extract are 
displayed in the bottom panel. (C) The interaction between HA-Ku80c 
and Pgm is resistant to DNasel treatment. The co-precipitation 
experiment was performed as described in A, using MBP-Pgm, HA- 
KuSOc and 6His-Ku70a recombinant proteins produced from baculo- 
virus vectors. EDTA was removed from the lysis buffer and replaced by 
1 0 mM MgCl2. Half of the sample was treated with 40 [ig/mL of Dnase I 
during the 2-hr incubation with amylose-coupled magnetic beads. The 
presence of HA-Ku80c in the purified complexes was revealed on 
western blots using an anti-HA antibody. 
doi:10.1371/journal.pgen.1004552.g007 



HA-tagged versions of Ku70a or Ku80c were produced in a 
heterologous insect cell system, together with Pgm fused to the 
maltose-binding protein (MBP) at its N-terminal end. For each 
condition, the MBP-Pgm protein was precipitated from soluble cell 
extracts using amylose magnetic beads, and co-precipitation of the 
Ku subunits was monitored on western blots (Figure 7 A). We 
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observed that Ku7()a and Ku80c co-precipitated with MBP-Pgm, 
either individually or when the two subunits were co-expressed in 
the same cells. Control experiments confirmed that the enrich- 
ment in either Ku subunit is specific for the presence of MBP-Pgm 
in the extracts (Figure 7A). Reciprocally, we could co-immuno- 
precipitate Pgm with HA-tagged Ku from extracts of insect cells 
co-expressing both proteins, using magnetic beads coated with 
anti-HA antibodies (Figure 7B). The association of Pgm and Ku 
was resistant to DNase I (Figure 7C), suggesting that the formation 
of a Pgm/Ku complex does not depend upon the presence of 
DNA. Taken together, our data indicate that Pgm and Ku 
assemble in a higher-order complex in soluble cell extracts. 

Discussion 

A specific Ku heterodimer is required for MAC 
development as part of a developmental transcription 
program 

Gene duplication has been proposed to be a driver of genome 
evolution, allowing the sub-functionalization of duplicated genes 
and sometimes leading to the emergence of novel cellular functions 
[25]. In P. tetraurelia, the presence of two KU70 and three KU80 
genes has been the result of successive WGDs [18]. The KU80 
family provides a nice example of sub-functionahzation, with 
KU80a and KU80b being constitutively expressed throughout the 
life cycle, and KU80c exhibiting a characteristic induction pattern 
correlating with its specific function during MAC development. 
Alignment of the three Ku80 proteins reveals that Ku80c differs 
from Ku80a and Ku80b at several positions (Figure S5), some of 
which may lie on the exposed surface of the (x/p domain [26] and 
might possibly interact with additional partners. Future studies 
should address the question of whether the specific function of 
KU80c has resulted only from its overexpression or also involves a 
specialization of the protein in assisting PiggyMac-dependent 
genome rearrangements. The function(s) of the KU70 genes could 
not be investigated separately through RNAi. The Ku70a and 
Ku70b proteins are 98% identical (see Figure S5), with only 
conser\'ative amino acid changes in their a/fi and fi-barrel 
domains, which suggests that they perform similar functions. 
However, using RNA deep sequencing, we confirmed that the two 
recently duplicated KU70 genes exhibit distinct transcription 
patterns, with KU70a being overexpressed during MAC develop- 
ment. We propose that, similar to KU80c, KU70a might have 
been undergoing specialization to carry out an essential function in 
PGR. 

According to previous microarray hybridization analyses, 
successive transcription induction peaks (early, intermediate and 
late) were identified at the genome-wide level during autogamy 
[20]. KU80c and PGM belong to the same "intermediate" cluster 
of genes that are induced by the time PGR take place during MAC 
development, and are repressed at later time-points during 
autogamy. Our study shows that the transcription of PGM is 
induced normally after Ku80c depletion. However, while PGM 
expression decreases at late time-points in a control, after PGR are 
essentially completed, it is continuously turned on in cells depleted 
in KuSOc, while PGR are strongly inhibited. Reciprocally, after 
Pgm depletion, which blocks PGR, the transcription of KU80c is 
induced normally but is not switched off. Quite interestingly, 
normal progression of autogamy is observed at the cytological level 
upon Ku80c or Pgm depletion, indicating that cytological 
progression of MAC development (DNA amplification, segrega- 
tion of the new developing MACs into daughter cells) in 
Paramecium is uncoupled from the completion of PGR. Although 
alternative explanations might be proposed, the observation that 



similar transcriptional deregulation is observed at late autogamy 
time-points in KU80c and PGM knockdowns suggests that the 
cause of aberrant transcript accumulation for genes from the 
intermediate cluster is a failure to complete PGR. For instance, a 
transcriptional activator specific for the intermediate gene cluster 
may be expressed from an lES-containing gene in the developing 
MAC, and switched off as soon as lES excision has been 
completed. A regulatory mechanism relying on the retention of an 
lES overlapping a gene promoter has been shown to control the 
expression of a mating-type gene in P. tetraurelia [27]. In the 
cUiate Euploles rrassus, PGR also regulate the expression of a 
development-specific telomerase gene that is localized in the 
germline-restricted part of the genome and is, therefore, switched 
off naturally once it is eliminated [28] . The existence of feedback 
regulatory loops provides a nice illustration of how ciliates may 
take advantage of PGR to fine-tune gene expression during their 
sexual cycle. 

The unexpected role of Ku in Paramecium: Coupling of 
DNA cleavage and DSB repair during PGR 

Upon Ku70 or Ku80c depletions, we found that germline 
sequences are retained in the new MAC: both lES excision and 
chromosome fragmentation are inhibited, which confirms that the 
developmental-specific Ku70/Ku80c heterodimer plays an essen- 
tial role in PGR. We confirmed that PGM expression, which is 
required for the two types of genome rearrangements [15], is still 
induced in Ku-depleted cells and that Pgm still localizes to the 
developing new MACs. As discussed above, the persisting 
overproduction of Pgm at late autogamy time-points appears to 
be a consequence, and not the cause, of defective genome 
rearrangements in Ku-depleted cells. 

With regard to lES excision, no de novo precise excision 
junctions were detected upon Ku depletion, consistent with a 
defective C-NHEJ pathway. Neither did we detect any imprecise 
junction that may have resulted from alt-NHEJ. More surprisingly, 
we observed that the non-rearranged version of the genome is 
amplified in the new MAC, and that no DSBs with the expected 
geometry' can be detected at lES boundaries using a sensitive 
LMPCR molecular approach. Because lES excision starts after 3 
to 4 rounds of genome amplification in the new MAC [29], we 
considered the possibility that HR substitutes for end-joining 
during the repair of lES excision sites in Ku-depleted cells, which 
could restore lES^ chromosomes, supposing that a yet non- 
rearranged sister chromatid were used as a template. During HR- 
mediated repair, DSBs would l)c processed through 5' to 3' 
resection, which would make brok('n ends inappropriate substrates 
for LMPCR. IIowc\(;r, using a sensitive TdT-taUing assay, we 
obtained no evidence that HR intermediates are formed, based on 
the absence of detectable free 3' ends at lES boundaries. We 
therefore conclude that lESs are retained in the genome of Ku- 
depleted cells, as a consequence of defective programmed DNA 
cleavage at their boundaries. Our findings point to the participa- 
tion of the development-specific Ku70-Ku80c heterodimer in 
Pgm-dependent DNA cleavage, upstream of its likely function in 
C-NHEJ-mediated DSB repair. 

In another cUiate, Tetrahymena thermophila, lES excision is 
mediated by a Pgm homolog, Tpb2p, which is responsible for 
DNA cleavage at lES boundaries [30,31]. T. thermophila harbors 
one KU80 and two KU70 genes: Tpb2p-dependent DNA 
cleavage does occur in a TKU80A strain, but the resulting DSBs 
are not repaired, leading to an arrest in MAC development and to 
DNA loss in the new MAC [32]. These phenotypes, which are 
quite similar to those described for Ligase IV or Xrcc4 depletions 
in P. tetraurelia [16], are fully consistent with a classical scenario. 
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in which C-NHEJ factors are recruited to broken DNA ends after 
DNA cleavage at lES boundaries. Interestingly, the presence of 
Ku is not a prerequisite for DNA cleavage in T. thermophila, and 
Tku80p localization remains dispersed throughout the developing 
MAC when Tpb2p concentrates in large heterochromatin bodies, 
in which DNA elimination is thought to take place [32]. It is 
unclear, therefore, whether Ku interacts direcdy with Tpb2p 
during genome rearrangements in this cUiate. Together with the 
presence of a single KU80 gene in T. thermophila, the observed 
differences between the two cUiates suggest that specialization of 
KU80c in P. tetraurelia may have occurred after the divergence 
between Paramecium and Tetrahymerm. Noteworthy, lES excision 
is rather imprecise in T. thermophila, and heterogeneity at lES 
excision junctions could be attributed to variability in the choice of 
Tpb2p cleavage sites [33] or to the participation of different 
mechanisms in the formation of lES excision junctions, such as C- 
NHEJ-mediated DSB repair [32] or trans-esterification if a single 
boundary is cleaved [34,35]. Accordingly, T. thermophila appears 
to have avoided lES insertion into coding regions [36], where 
imprecise excision could be deleterious. 

The situation is quite different in P. tetraurelia, in which 47% of 
genes are interrupted by at least one lES [11]. The pressure to 
assemble functional open reading frames in the somatic genome 
has driven the emergence of a highly efficient and precise lES 
excision mechanism. Several known features of the excision 
process contribute to this precision: (i) the establishment of a 
crosstalk between lES ends before DNA cleavage, within a 
transpososome-like complex [11,24], and (ii) the precise position- 
ing of Pgm-dependent cleavages on each flanking TA [17]. Here, 
we show that the development-specific Ku70/Ku80c heterodimer 
is required to activate Pgm-dependent DNA cleavage at lES 
boundaries in vivo. Although we excluded an inhibitory effect of 
Ku80c depletion on PGM transcription, we cannot formally rule 
out that a Ku-dependent upstream developmental event activates 
DNA cleavage. However, our observation that Ku70a and Ku80c 
interact with Pgm in puU-down experiments suggests that DNA 
cleavage and C-NHEJ-mediated DSB repair are tightiy inter- 
twined through the incorporation of the development-specific 
Ku70a/Ku80c heterodimer into the DNA cleavage complex itself 
(Figure 8): programmed DSBs, therefore, would be efficiently 
directed towards C-NHEJ-mediated precise repair. We propose 
that the Ku70/Ku80c heterodimer is an essential Pgm partner 
that activates the assembly of a transpososome-like nucleoprotein 
complex competent for DNA cleavage. In support to this model, 
we found that Ku70a and Ku80c interact with Pgm in cell 
extracts. During lES excision, Ku and Pgm may associate once 
Pgm is bound to lES boundaries (Figure 8A: activation of Pgm- 
dependent cleavage), or before Pgm interacts with DNA, as 
suggested by our observation that the Pgm/Ku complex is 
resistant to DNase I in vitro (Figure 8B: activation of the DNA 
binding activity of Pgm). Further biochemical work is needed to 
analyze the Ku/Pgm interaction and clarify the mechanisms 
involved in Ku-mediated activation of DNA cleavage. We cannot 
exclude at this stage that the DNA-PKcs is also a component of the 
DNA cleavage complex (Figure 8): indeed, a parallel study 
established that DNA-PKcs depletion impairs the excision of at 
least a subset of lESs, which exhibit a reduced efficiency of DNA 
cleavage at their boundaries (Malinsky et al., in prep.). The 
incorporation of early C-NHEJ proteins in the cleavage complex 
would allow the cell to face the challenge of repairing efficiently 
and precisely tens of thousands of DSBs during the short period of 
MAC development. In contrast, Ligase IV and Xrcc4 are clearly 
not required for DNA cleavage [16] and are probably not part of 
the cleavage complex. 



A 




Figure 8. Models for the assembly of an active DNA cleavage 
complex. (A) Ku associates with DNA-bound Pgm and activates DNA 
cleavage. Pgm and its putative partners (in grey) would recognize and 
bind the boundaries of eliminated sequences. The binding of Ku (in 
blue) activates Pgm for DNA cleavage (symbolized by the switch from a 
rectangular box to an oval), perhaps by assisting the formation of a 
synapse between both lES ends, in a transpososome-like intermediate. 
The DNA-PKcs catalytic subunit (in peach) may also be part of the active 
DNA cleavage complex. (B) Ku forms a complex with Pgm in the 
absence of DNA, activating the DNA binding and/or cleavage activities 
of Pgm. Following DNA cleavage, conformational remodeling of the 
complex would position Ku on broken DNA ends and allow it to 
perform its classical role in C-NHEJ-mediated DSB repair. MAC DNA is 
represented in black, lES DNA in red. 
doi:10.1371/journal.pgen.l004552.g008 

Integration of DNA cleavage and repair in recombination 
factories during PGR 

Domesticated transposases from cut-and-paste DNA transpo- 
sons might have been recruited to perform PGR in various 
systems, not only because of their DNA cleavage activities, but 
perhaps also because of the particular features of cut-and-paste 
transposition [37]. When transposons integrate into their target 
site, they duphcate a short sequence, the TSD (target site 
duplication), on each side of the integrated element. During the 
next round of transposition, the transposase cuts the DNA, excises 
the transposon and leaves a DSB at the donor site, which is 
repaired through cellular pathways. The study of cut-and-paste 
transposons has revealed that C-NHEJ, as opposed to alt-NHEJ, 
accounts for most end-joining events during DSB repair at 
transposon donor sites (reviewed in [5]). Moreover, several cut- 
and-paste transposons and/ or their transposases interact with Ku. 
For instance, Ku70 binds the ends of the P element from 
Drosophila melanogaster and stimulates DSB repair at the donor 
sites [38]. In vitro, the transposase of Sleeping Beauty, a 
reconstructed Tc/mariner transposon, forms a complex with 
Ku70, and efficient transposition of Sleeping Beauty in a cellular 
system depends on the presence of DSB repair proteins [39]. 
However, it is not clear in vivo whether Ku/transposase 
interaction activates DNA cleavage at transposon ends or simply 
facilitates the recruitment of the C-NHEJ pathway once the donor 
site has been broken. 

With regard to PGR, the formation of a complex involving a 
nuclease and DSB repair factors has been hypothesized during 
V(DJ recombination [40]. In vitro, Ku interacts with the 
domesticated transposase RAGl [41], but no evidence has been 
provided that Ku is required in vivo for the introduction of RAGl - 
dependent programmed DSBs. As in Tetrahymerm, the formation 
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of nuclear recombination centers, or recombination factories, may 
facilitate V(D)J recombination by bringing together the recombi- 
nation sites, the RAG1/RAG2 endonuclease and DSB repair 
factors. However, in this system, the presence of C-NHEJ proteins 
appears dispensable for DNA cleavage itself The interplay 
between DNA cleavage and DSB repair has been pushed one 
step further in Paramecium. Indeed, the present study of lES 
excision in P. tetraurelia provides the first evidence that Ku is 
absolutely required in vivo to introduce programmed DSBs during 
PGR. The demonstration that DNA cleavage mediated by a 
transposase-related protein and C-NHEJ mediat(-d r("pair are 
coupled during PGR in P. telraurelia is reminiscent of the 
observation that, during meiosis in S. cerevisiae, the Mrellp HR 
protein is required for DNA cleavage by the topoisomerase-like 
Spoil endonuclease [42]. These two systems support the notion 
that the presence of DSB repair factors in recombination factories 
may be a prerequisite for DNA cleavage during programmed 
genome rearrangements. 

Materials and Methods 

P. tetraurelia strains and growth conditions 

For autogamy time-course experiments, we used P. tetraurelia 
strain 51 new (hereafter called 51) and its 51AA variant carrying a 
heritable deletion of the A gene in its MAC but harboring a wUd- 
type MIC [24]. To facilitate the screening of transformants in 
microinjection experiments, we introduced the nd7-l mutation 
[43] into strain 51 by conjugation, or used somatic variants 
carrying a MAC deletion of the ND7 gene [44]. Cells were grown 
at 27''C in a wheat grass infusion (\VGP; Pines International Inc.) 
inoculated with Klebsiella pneumoniae. Autogamy was carried out 
through starvation as described [29]. Total RNA and genomic 
DNA were extracted from ~400,000 cells for each time-point and 
quantified as described [15]. 

Molecular procedures 

Oligonucleotides were purchased from Sigma-Aldrich or 
Eurofms MWG Operon (Table SI). 

PGR amplifications were performed in a final volume of 25 |J,L, 
with 10 pmol of each primer, 5 nmol of each dNTP and 1 U of 
DyNAzyme 11 DNA polymerase (Finnzymes) or DreamTaq 
(Thermo Scientific) according to the enzyme suppliers' recom- 
mendations. PGR products were analyzed on 3% NuSieve GTG 
agarose gels (BioWhittaker Molecular Applications). LMPGR 
detection of double-strand breaks was performed as described 
[24]. Poly(C) tailing office 3' ends using terminal transferase was 
performed as described [17], using 500 ng of input total genomic 
DNA. Sanger DNA sequencing was performed at GATC Biotech, 
or using the fmol DNA Cycle Sequencing System (Promega). 
Northern and Southern blot hybridization with ^^P-labeled probes 
was carried out as described [15]. The KU70 and KU80c probes 
are described in Figure S2 and Table SI. The sequence of the 17S 
rRNA oligonucleotide probe is shown in Table SI. 

Protein depletion by RNAi 

Description of RNAi plasmids. All RNAi plasmids are 

derivatives of vector L4440 [45] and carry a target gene fragment 
between two convergent T7 promoters. Each insert was chosen in 
order to minimize the risk of cross-silencing by using the "RNAi- 
ofi'-target" tool of ParameciumDB [46]. Plasmids pXRCC4-R 
[16], pPGM-1 [15], pOND7c [44] and pIGL7a [47] carry RNAi 
inserts targeting the XRCC4, PGM, ND7 and ICL7a genes, 
respectively. RNAi plasmids targeting the KU genes were 
constructed as follows: 



pL4440-KU70a-l: a 300-bp fragment from gene KU70a (bp 
514—813 from ATG start codon) was amphfied by PGR using 
primers ku70A-BamHl_l and ku70A-KpnI_l, digested with 
BamHI and Kpnl, and ligated between the Bglll and Kpnl sites 
of plasmid L4440. 

pL4440-KU80a-l: a 233-bp fragment from gene KU8()a (bp 
40-272 from ATG start codon) was amplified by PGR using 
primers ku80A-BamHl_l and ku80A-KpnI_l, digested with 
BamEl and Kpn\, and ligated between the Bglll and Kpnl sites 
of plasmid L4440. 

pL4440-KU80a-2: a 450-bp fragment from gene KUHOa (bp 
580-1029 bp from ATG start codon) was amplified by PGR using 
primers OMB223 and OMB224, digested with Spel and ligated 
into the Xbal site of plasmid L4440. 

pL4440-KU80b-l: a 233-bp fragment from gene KU80b (bp 
40-272 from ATG start codon) was amplified by PGR using 
primers OMB219 and OMB220, digested with Spel and ligated 
into the Xbal site of plasmid L4440. 

pL4440-KU80b-2: a 450-bp fragment from gene KUSOb (bp 
580-1029 bp from ATG start codon) was amplified by PGR using 
primers OMB221 and OMB222, digested with Spel and ligated 
into the Xbal site of plasmid L4440. 

pL4440-KU80c-l: a 206-bp fragment from gene KUSOc (bp 
1021-1226 from ATG start codon) was amplified by PGR using 
primers ku80C-BamHl_l and ku80C-Kpnl_lbis, digested with 
BamUl and Kpml, and ligated between the BglTl and Kpnl sites of 
plasmid L4440. 

pL4440-KU80c-2: a fragment from gene KUSOC was amplified 
by PCR using primers OMB225 and ku8()C-KpnI_lbis. Follow- 
ing restriction with Spel, a 450-bp sub-fragiuent (bp 557-1006 
from ATG start codon) was inserted into the Xbal site of plasmid 
L4440. 

Outline of the "feeding" procedure. RNA interference 
was achieved as described [15], by feeding P. tetraurelia strains 51 
or 51AA on Escherichia coli HT115 bacteria transformed with 
each plasmid and induced for the production of double-strand 
RNA corresponding to each RNAi insert. Survival of the progeny 
was tested at day 4 of starvation by transferring 30 individual 
autogamous cells to standard K. pneumoniae medium. 

Determination of gene expression levels by RNA-seq 

Strand-specific RNA-seq libraries were prepared from 50 ng 
polyA+ RNA following the directional mRNA-seq library 
preparation protocol provided by lUumina: RNAs were fragment- 
ed using fragmentation buffer, purified and treated with phospha- 
tase and kinase prior to sequential ligation with different RNA 
adapters to the 3' and 5' ends. The hgated RNA fragments were 
reverse-transcribed, followed by PCR amplification. Each library 
was sequenced using an lUumina Genome Analyzer Tlx to 
generate 75-nt paired-end reads. Reads were mapped with 
TopHat2 [48] (read-mismatches 1; min-intron-length 15; max- 
intron-length 100) on the P. tetraurelia strain 51 reference MAC 
genome [1 1]. Alignments were indexed using Samtools [49] and a 
custom perl script was used to count uniquely mapped fragments 
for each gene model. The counts were normalized to account for 
gene size and the total number of mapped fragments. 

Construction of GFP fusions 

For the construction of in-frame GFP-KU fusions, a GFP- 
coding fragment adapted to Paramecium codon usage [50] was 
added by PGR fusion to the 5' end of the KU70a or KUSOc genes. 
Each construct was inserted in a pUC 1 8 plasmid between the SphI 
and Sad sites. As a result, the GFP is fused to the N-terminus of 
Ku70a and Ku80c and the fusion proteins are expressed under the 
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control of the KU70a and KU80c transcription signals (promoters 
and 3'UTR), respectively. The PGM-GFP fusion will be described 
in detail elsewhere (Dubois et al., in prep.). Briefly, the GFP- 
coding fragment was added by PGR fusion to the 3' end of the 
PGM gene carried by plasmid pPBL49g [15]: in the resulting 
construct, the PGM-GFP coding sequence is flanked by 96 bp 
upstream of the ATG (i.e. the putative endogenous PGM 
promoter) and 54 bp downstream of the TGA stop codon 
(including the endogenous 3' UTR and the polyadenylation 
site). 

Transformation of Paramecium cells and localization of 
fluorescent fusion proteins 

Plasmids encoding GFP fusion proteins were linearized by 
appropriate restriction enzymes and microinjected into the 
MAG of vegetative 51 nd7-l or 51AND7 cells, as described 
[15]. A complementing plasmid carrying a functional ND7 
gene [43] was coinjected with the fusion transgenes to facilitate 
the selection of transformants. During autogamy, cells were 
permeabilized for 4 min in PHEM (60 mM Pipes, 25 mM 
Hepes, 10 mM EGTA, 2 mM MgGlj pH 6.9) +1% Triton, 
then fixed for 10 min in PHEM +2% paraformaldehyde. All 
observations were performed using a Zeiss Axioplan 2 Imaging 
epifluorescence microscope. Developing MAGs were identified 
using Nomarski differential interference contrast (DIG) com- 
bined with 4',6-diamidino-2-phenylindole (DAPI) staining. No 
lethality was observed in the post-autogamous progeny of 
transformed cells. 

Protein expression in insect cells and co-precipitation 

assays 

Plasmids and vectors. DNA sequences coding for Pgm, 
Ku70a and Ku80c were obtained by gene synthesis (DNA 2.0 or 
Eurofms MWG/Operon). The synthetic PGM DNA sequence 
was first cloned into the pMAL-c2x vector (New England Biolabs). 
The MBP-PGM fusion, the MBP, the KU70a and the KU80c 
sequences were further introduced into the pVL-1392 vector (BD 
Biosciences). A HA tag was added to the G-terminus of Ku70a and 
the N-terminus of Ku80c. All plasmid sequences are provided in a 
supplementary information file (Text SI). Plasmids pVL1392- 
MBP-PGM, pVL1392-MBP, pVL1392-KU70a-HA and 
pVL1392-HA-KU80c were transfected individually into High 
Five cells together with the BD BaculoGold Linearized Baculo- 
virus DNA (BD Biosciences). 

Preparation of soluble protein extracts from baculovirus- 
infected High Five cells. High Five cells were grown in the 
EX-GELL 405 synthetic medium (Sigma Aldrich) supplemented 
with Penicillin and Streptomycin. For co-expression experiments, 
10*^ ceUs are infected with MBP, MBP-PGM, KU70a-HA and/or 
HA-KU80c recombinant baculoviruses. In this system, each gene 
is expressed under the control of a late viral gene. At 48 hrs post- 
infection, cells were collected, washed with cold PBS buffer and re- 
suspended in 500 nl of lysis buffer containing 10 mM Tris-HGL 
pH 7.5, 150 mM NaGl, 1% Triton X-100(v/v), 1 mM EDTA, 
1 mM DTT and 0.5% NaDesoxycholate (m/v) in the presence of 
protease inhibitors (Gomplete Protease Inhibitor GocktaU Tablets, 
Roche). Gells were lysed for 20 min on a rotating wheel, then 
centrifuged at 4°C for 15 min at 10,000 g. 

Co-precipitation assays. For MBP-tagged proteins, soluble 
extracts were incubated for 2 hrs with 100 |J.g of amylose magnetic 
beads (New England Biolabs). For HA-tagged proteins, 1.5 |ig of 
monoclonal anti-HA antibodies (HA-7 from Sigma Aldrich) were 
incubated overnight on a rotating wheel at 4°G with 10 (xl of 
protein G magnetic beads (New England Biolabs), then cell 



extracts were incubated for 2 hrs with the coated beads. In both 
experiments, the beads were washed 3 times with 1 ml of lysis 
buffer, then re-suspended in Laemmli buffer before electrophoresis 

in SDS-polyacrylamide gels. Alirjuots were saved for input control. 
HA-tagged Ku proteins were detected on western blots, using 
monoclonal anti-HA primary antibodies (same as above) and anti- 
mouse HRP-coupled secondary antibodies (Promega). MBP- 
tagged proteins were detected with anti-MBP HRP-coupled 
antibodies (New England Biolabs). Pgm was detected using 
polyclonal anti-Pgm rabbit primary antibodies (Proteogenics) 
raised against tiie DKNEKDAEDEFCiDLNPSEHK peptide from 
the N-terminus of Pgm, and an anti-rabbit HRP-coupled 
secondary antibody (Promega). 

Supporting Information 

Figure SI Progression of autogamy in the cultures used in this 
study. For each time-point, cell stages were monitored by DAPI 
staining. Veg: vegetative cells. Skeins: cells with elongating old 
MAG at the beginning of fragmentation. Meiosis*: cells with 
detectable meiotic MIGs (which is an underestimate of the actual 
fraction of meiotic cells). Fragments: cells with fully fragmented old 
MAG, but no visible new MAG (too small or indistinguishable 
from fragments). 2 anlagen: cells with two visible new developing 
MAGs. postA: post autogamous cells with one new MAG and a 
few fragments of the old MAG. (A) Autogamy time course of strain 
5 1 subjected to RNAi against ND7 or PGM. The ND7 and PGM- 
1 constructs used for RNAi were described in [15]. (B) Autogamy 
time course of strain 5 1 subje[:ted to RNAi against ND7 or KUSOc 
(using the KU80c-2 [:onstruct, see Figure S2). (G) Autogamy time 
course of strain 51AA subjected to RNAi against ICL7a (RNAi 
construct described in [47]) or KVSOc (RNAi construct: KU80c- 
2). Note that in the ICL7 RNAi, a fraction of cells underwent 
autogamy prematurely, accounting for the presence of 20% of cells 
with fragmented MAGs in time-point 1 (VIcl7) and for the 
detection of 10% post-autogamous cells (postA) in the TO sample. 
Taking into account the asynchrony of the ICL7 RNAi 
experiment, the corresponding stages between the IGL7 and 
KUBOc RNAi experiments were numbered from 1 to 6 
(intermediate stages were labeled with a'). 
(TIF) 

Figure S2 Maps of the KU70 and KU80 genes of P. tetraurelia. 
The maps show the position of the inserts used for RNAi 
constructs (blue) and hybridization probes (red, see Table SI). For 
KU70 genes, aU fragments were designed from the KU70a 
sequence. Specific fragments were designed for each KU80 gene: 
all experiments described in the paper were performed using 
inserts KU80-a2, KU80-b2 and KU80-c2. Qualitatively similar 
phenotypes (survival/lethality in sexual progeny and presence/ 
absence of de novo lES excision junctions) were observed when 
RNAi was performed using inserts KU80-al (which also targets 
KV80b mRNA) and KU80-cl. 
(TIF) 

Figure S3 Detection of de novo lES excision junctions in 
different RNAi conditions. The excision of lESs 51A1835, 

51A2591 and 51A4404 during autogamy of strain 51AA (4 days 
of starvation) was tested by PGR amplification (PGR primers are 
displayed in Table SI). RNAi conditions are indicated next to each 
panel. Here, starvation was prolonged for one additional day 
relative to the other experiments shown in the paper. Therefore, 
day 4 corresponds to approximately 20 hours following time-point 
6 in Figure 4. The low levels of IES~ forms that are detected very 
late during autogamy might be attributable to the recovery of 
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significant amounts of full-length KU80c mRNA upon prolonged 

starvation (see Figure 3C and related text). 

(TIF) 

Figure S4 Search for alternative excision junctions for lES 
51A2591. (A) Map of lES 51A2591 and its flanking MAC-destined 
sequences. The primers used for PGR reactions are indicated by 

arrowheads (see Table SI). (B) No detection of alternati\'e de novo 
lES excision junctions during autogamy of .51AA cells, in an ICL7 
(control) or a KU80c RNAi. PGR reactions were performed using 
primers OMB808 and OMB809 and loaded on a 2% agarose gel. 
(TIF) 

Figure S5 Alignments of the Ku70 and Ku80 homologs from P. 
tetraurelia. All sequences were retrieved from the Parame- 
ciumDB database, using the following accession numbers: 
GSPATP00006445001 (Ku70a), GSPATP00009747001 (Ku70b), 
GSPATP00034664001 (Ku8()a), GSPATP00035446001 (Ku80b), 
GSPATP000300950()1 (Ku8()( ). Alignments of Ku80 (jianel A) and 
Ku70 homologs (panel B) were performed separately using the 
MUSCLE multiple sequence alignment software (http://www.ebi. 
ac.uk/Tools/ msa/ muscle/) and colored using the BoxShade server 
(http://www.ch.embnet.org/soltware/BOX_form.html). For each 
Ku subunit, domain annotation was based on Pfam analysis of 
conserved domains (http://pfam.xfam.org/ and [52]). 
(PDF) 
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